Regression Modeling for Spherical Data via Non-parametric and Least Square Methods
نویسندگان
چکیده مقاله:
Introduction Statistical analysis of the data on the Earth's surface was a favorite subject among many researchers. Such data can be related to animal's migration from a region to another position. Then, statistical modeling of their paths helps biological researchers to predict their movements and estimate the areas that are most likely to constitute the presence of the animals. From a geometrical view, spherical data are points that take their values on the surface of a unit sphere. There are many methods to fit a curve, especially regression curves to spherical data. For example, Gould [1] used the corresponding angles of spherical data coordinates to introduce regression models. He considered Fisher distribution as a candidate density for the error in his analysis. A non-parametric version of his model was proposed by Thompson and Clark [2]. Usually, the data that are close to the North or South pole have different behavior. Hence, their proposed model was failing to work there, and so they tried to keep the data somewhat away from the pole via adopting their model. They advocated overcoming this problem by using the tangent plane and suggested the use of splines there [3]. On the other hand, Fisher et al. [4] proposed two families of the spherical spline for spherical data. They introduced two families of curves using differential geometry suitable for fitting the splines. One of the methods to predict statistics is to utilize non-parametric regression models. Another strategy is to consider some forms of smooth models. Both of these procedures, along with other approaches in non-Euclidean statistics context are somewhat an initiative method in analyzing the spherical data. It worths mentioning that the benefits of using the spline path by employing the rotation parameters were of interest in directional statistics in [5], albeit for circular data. One of the interesting techniques to construct the non-parametric regression model was to minimize the Euclidean risk function, first proposed in [6]. We also follow the same procedure in this paper. In particular, the primary objective of this paper is to introduce a non-parametric regression model based on minimizing the mean square errors risk function for spherical data. To apply this idea, we used the suggested method in [6] for data on the circle. We initiate our model by considering two separate models for two common angles on the sphere. Then, we impose a correlation among these angles using an appropriate risk function. The proposed models will be evaluated using simulated and real-life data. Material and methods In this paper, we presented two methods for modeling spherical data. The first one considers, separately, a regression model for each angle on the sphere. To construct a feasible model, a risk function is then suggested for modeling spherical data using Haversine distance. A non-parametric longitudinal model is derived by minimizing the proposed risk function. Hence, a parametric longitudinal model for spherical data, as the second method, is built. The estimates of the parameters in the latter model are done using the quadratic risk function. Results and discussion Some of the data sets are intrinsically on the surface of a sphere in many scientific disciplines. For example, the location of quakes on earth can be considered a point with a constant norm on a unit sphere. Many researchers paid attention to construct a proper model to analyze such data. Regression models are among popular forms of treating spherical data, statistically. In this paper, we also attempted to provide an efficient model to analyze spherical data. To aim this, we first adopted a regression model for each angle on the sphere, independently. Our methods included two different approaches; a non-parametric longitudinal regression modeling and minimizing a least square error framework to construct a parametric longitudinal model. In the first method, the Haversine distanced, and its minimization were considered. The validity of this approach was studied using simulated and real-life data. Then, regression modeling was proposed using the least-square error approach with an appropriate link function. Although the efficiency of this latter method in comparison with the former was in doubt, it was able to provide a suitable smooth paths prediction on the sphere. Moreover, the proposed method was more appropriate while using Haversine distance. The idea to increase the efficiency of the current model is using other distances having a secure connection with the least square method suitable for spherical data. Conclusion The following conclusions were drawn from this research. A non-parametric model inspired by previous models and a generalized version of it from circle to sphere was introduced. A risk function was proposed based on the Haversine distance on sphere. Two separated longitudinal models were suggested for the angles on the sphere and then a correlation was imposed using the least square risk function. Although the non-parametric method was more accurate in analyzing real data, the parametric method predicts more smooth paths../files/site1/files/51/%D9%85%D9%82%DB%8C%D9%85_%D8%A8%DB%8C%DA%AF%DB%8C.pdf
منابع مشابه
Robust Least Square Support Vector Regression for Contaminated Data Modeling
Weighted least squares support vector machine (WLSSVM) is a robust version of least squares support vector machine (LS-SVM). It adds weights on error variables to eliminate the influence of outliers. But the weights, which largely depend on the original regression errors from unweighted LS-SVM, might be unreliable for correcting the biased estimation of LS-SVM, especially for the training data ...
متن کاملRegularized Least Square Regression with Spherical Polynomial Kernels
This article considers regularized least square regression on the sphere. It develops a theoretical analysis of the generalization performances of regularized least square regression algorithm with spherical polynomial kernels. The explicit bounds are derived for the excess risk error. The learning rates depend on the eigenvalues of spherical polynomial integral operators and on the dimension o...
متن کاملConsistency for the Least Squares Estimator in Non-parametric Regression
We shall study the general regression model Y = g 0 (X) + ", where X and " are independent. The available information about g 0 can be expressed by g 0 2 G for some class G. As an estimator of g 0 we choose the least squares estimator. We shall give necessary and suucient conditions for consistency of this estimator in terms of (basically) geometric properties of G. Our main tool will be the th...
متن کاملPartial least squares methods: partial least squares correlation and partial least square regression.
Partial least square (PLS) methods (also sometimes called projection to latent structures) relate the information present in two data tables that collect measurements on the same set of observations. PLS methods proceed by deriving latent variables which are (optimal) linear combinations of the variables of a data table. When the goal is to find the shared information between two tables, the ap...
متن کاملPartial Least Square Regression PLS-Regression
PLS regression is a recent technique that generalizes and combines features from principal component analysis and multiple regression. Its goal is to predict or analyze a set of dependent variables from a set of independent variables or predictors. This prediction is achieved by extracting from the predictors a set of orthogonal factors called latent variables which have the best predictive pow...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ذخیره در منابع من قبلا به منابع من ذحیره شده{@ msg_add @}
عنوان ژورنال
دوره 5 شماره 1
صفحات 85- 97
تاریخ انتشار 2019-08
با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.
کلمات کلیدی برای این مقاله ارائه نشده است
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023